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Abstract. I n thi s arti cl e, a seri es of experi ments wi 1 1 be descri bed i n whi ch 
the expressiveness of maps is shown to be different from the one we find for 
texts. The hypothesis is that types of geographical information exist that can 
be stored i n and read from a text, but that are i mpossi bl e to express as geo- 
graphical maps without significant loss of meaning. The article shows how 
the hypothesi s was supported by a seri es of model ling experi ments. 
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1. Introduction 

The ai m of this study is to understand the expressiveness of texts i n the area 
of geographical information better, using topographical maps as a yardstick 
with which to compare texts. What can a text say, and how is it different 
from what a map can say? The text used as the source for this study is a 
published volume of border protocols from 1742-1745 (Schnitler 1962). 
This article is based on my 2012 PhD from King's College London, Depart- 
ment of Digital Humanities. 

The creator of the protocols, Peter Schnitler, initiated a number of court 
meetings documented in his protocols. Schnitler did not only write the text, 
he also created the situation in which many of the events described in the 
text took place. The text documents situations of ordinary, skilled commu- 
nication about geography, in line with what Mark, Smith et al. (1999) de- 
scribes. According to the vocabulary developed in the field of regional geog- 
raphy, Schnitler's role was that of a gatekeeper, mediating among a number 
of different fol k geographi es exi sti ng at the same ti me: 



[W]ithin any region, there are multiple geographies, varieties of geo- 
graphical knowledges. ...And there are varieties of folk geographies - 
those inspired by diverse belief systems and ethnic traditions, as well 
as those which have emerged from diverse genres de vie farmers, 
fishermen, nomads, merchants and poets (Buttimer, Brunn et al. 
1999: BO; italics and missing punctuation in original) 

Schnitler also made his own observations during his travels, adding to the 
experience he already had. In this way, he became an inner voice along with 
the witnesses as wel I as the outer voi ce of professi onal knowl edge. 1 1 may be 
that this mix between being an inner and an outer voice made him more 
respectful towards the witnesses than a reporter working only from the out- 
side would have been. 

Because of the complexity of the historical and cultural background, Schnit- 
ler (1962) became a very rich textual expression of detailed geographical 
information. It is highly descriptive, and includes a number of different 
perspectives, covering various professions and ethnic groups. As such, it is a 
good obj ect of study for my experi ments. 



2. Method 

The method applied in this research is experimental modelling as it is used 
in digital humanities. Models are representations of something which are 
created for the purpose of studying what is modelled more closely (M cCarty 
2005). I studied the expressiveness of the source text by manipulating a 
model of it. Most of the experiments were performed in a qualitative man- 
ner, in line with traditional research in the humanities in which knowledge 
about specific instances of a textual phenomenon is in the first instance the 
main goal (Galey2010). Counting and compari ng occurrences i n the model 
to find results were done in only a few cases, and the quantitative methods I 
di d use were quite si mpl e. 

The modelling was based on the pretence that I knew nothing about the 
landscape described in Schnitler (1962), except for the descriptions in the 
text itself. I pretended not to have seen any map and not to know the area, 
or even general features of Scandinavian landscapes. I pretended not to see 
most of the context in order to get at what could be read out of the text 
alone. Avoiding context is impossible in a strict sense, but I limited the con- 
text taken into consideration as much as possible. 

The process of model I i ng consists of extracti ng readi ngs of si ngl e asserti ons 
made in the text. When a number of such assertions are formalised and 
interlinked with one another, a surprisingly complex structure is estab- 



lished, showing how much there is to even quite commonplace processes 
and assertions. This also shows how incomplete our knowledge is, and how 
easily we skip over details in order to get to the big picture. The computer 
application developed for the modelling, GeoModelText 1 offers rich ma- 
nipulatory power over the interlinked set of assertions. This power has been 
used to pinpoint details in how the text works and how it is different from 
topographical maps. 

The choice of the verbalised form, that is, 'modelling' rather than 'model', is 
no accident. The computational model as a fixed structure of knowledge 
was not a goal in itself, but rather a series of temporary states in a process 
of comi ng to know. The poi nt of a model ling exerci se I i es i n the process, not 
in the model as a product. The use of a computer sharpens the distinction 
between a model and a concept. The model invites us to manipulate it, as it 
is set up as an interactive system. I nteraction with the computer is doubled 
when computer programming is a part of the research. In addition to the 
interactivity one has as a user of a system, one also has the interaction of a 
developer with the computer. 

GeoModelText is not meant to be used for a full analysis and modelling of 
all information, or even all geographical information, in the source text. It 
has been created in order to analyse enough to make i nferences about how 
the source text works i n the context of the hypothesi s. 

2.1. Overview of the modelling stages 

The modelling process consisted of five main stages. The first stage is the 
text, imported as a digital document. The next stage is the primary model, 
consi sti ng of a set of statements i n a form i nspi red by ontol ogi es as they are 
used in digital cultural heritage, e.g., CI DOC-CRM (CI DOC 2011). 

Sentences I and 1 1 are exampl es of such statements i n the form of tri pi es. I 
expresses the fact that the text clai ms that north of the place referred to by 
the name "Rovola" there is a place referred to by the referring string "a val- 
ley". Sentence II expresses the textual claim that the valley from I is ap- 
proximately one-quarter milewide. 

(I) Schnl_8936 (Rovola) -> direction: north -> node48 (a valley) 



(II) node48 (a valley) — > has-width — > node49 (someVi mile) 



1 URL: http:// qeomodel .sourceforqe.net/ (checked 2012-11-15) 



Based on the primary model, the formalised model is developed. The pro- 
cess from primary to formalised model consists of bringing all the state- 
ments to a similar level of explicitness. The next stage is the vector data, 
where each of the places referenced above, the ones represented by 
"Schnl_8936" and "node48", are placed in a mathematical vector space 
based on the relationships between them. Such vector data can be ex- 
pressed as topographical maps, which isthefinal stage. 

2.2. Stepwise formalisation and fall-off 

A system of stepwi se formal i sati on i s central to the model ling method I use, 
and the computer tool GeoModelText is implemented to support it. I will 
show a simple example in order to give an overview of the process. It starts 
with the short sentence in the "text" column of table 1 and ends with the 
i 1 1 ustrati on of map vi sual i sati on i n f i gure 1 



Text 


Primary model 


Formalised model 


Vector 


Map 


Some Vi mile 
east of A is B 


Unknown mile: 0.25 
Direction: east 


2 kilometres 
Direction: 90 


A= (0; 0) 
B = (2000;0) 


Figure 1 



Table L Example of stepwise formalisation from text to map. 



Different types of expressions will 
vary in how much they are changed 
from one stage to the next i n the for- 
mal i sati on process. In some cases 
there is not much added in formalisa- 
tion from text to primary model. 
"East" is an example Of this. The di- Figure L Graphical representation of the 
recti ons are kept as words in the pri- model in table I showing how the points 
mary model, formalised into English would be put on a map. Scale in meters, 
expressions following a system of 15 

directions: north, north- north- east, north-east, etc. Other types of expres- 
sions need more formalisation at this stage, such as distance. The number is 
reformatted from a fraction to a decimal number, and the type of mile is 
expressed explicitly. The fact that it is not clear from the textual context 
what type of mile we have is expressed in the name used in the primary 
model, "unknown mile". 

In proceeding from pri mary to formalised model, a recalculation of the dis- 
tance from miles to kilometres is done. In the eighteenth century, an official 
mi le i n Norway was 113 km. M i les used by common people were i n general 
shorter, down to the half of the official, so one unknown mile can reasona- 
bly be seen as 8 kilometres. As for thedirection, the word "east" istranslat- 
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ed into a number, in this case, 90°. I n order to reach the stage of the formal- 
ised model, I had to make i mportant choices, for the distance as well as the 
direction; they could both have been chosen otherwise. This is a key point 
to which I will return below. 

Moving on from the formalised model to vector data, the distance and the 
di recti on are taken together. We choose to put A at the ori gi n of the vector 
room. Once that is done, and as B is 90° from A, and 2 kilometres away, the 
coordinate for B gives itself. Figure 1 shows the vector data as they would 
have been expressed on a topographical map. 

I nformation from real chunks of text "travels" through this process, and the 
method documents what is lost on the way, as well as what must be added 
in order to reach the goal. Some types of information are lost from one 
stage to the next because it is impossible to include that information in the 
version of the model at the next stage. It "falls off". I will use the phrase fall - 
off to refer to this, both asa verb (to fall off) and asa noun (a fall-off). 

Through this process I translate a set of geographical information from one 
medium to the other. The series of fall-offs shows me what is lost in the 
process. The process is necessarily iterative and includes a close human- 
machine interaction. For each small step, the model becomes a little bit 
more formal. I n the process, the fall -off —what is difficult to avoid losing— is 
the interesting parts. The fall-off will include things that cannot survive a 
transfer from one stage to the next, which misses the different level of for- 
mality. This includes, for example, a direction such as "east". In a system 
where one must add a specific number of degrees, such as 90, the openness 
of the word "east" falls off. 

Modelling in this project is transitive, so the map is a model of the text. In 
each step of the stepwise formal isati on, the next stage can be seen as a 
model of each of the previ ous stages, so that the map i s a model of the text. 

H owever, we must remember that even if a map is a model of the text, it is 
far from the only one possible. There are always other choices that can be 
made, from the simple level of the choice of a value at a specific step all the 
way to the overall methodological choices. Even if something falls off in the 
modelling process from text to map, it does not follow that it cannot be ex- 
pressed as an element of a map. It could be that it cannot make it through 
my system of formal models even if it could have been expressed in a topo- 
graphical map. So the list of fall-offs is not a list of textual features that 
cannot be expressed in maps, but rather a list of candidates for further 
study of types of expressi on that cannot make it to the map. 

The automatic vector data generation of GeoModelText was not devel- 
oped to a level where vector datasets, and thus maps, could be created for 



anything more than a handful of statements. Thus, the map creation system 
was only used to demonstrate principles at a simple level, not to document 
how maps could be made for stretches of text such as a full interview. 



3. Case studies 



I n this section, case studies that demonstrate some of the results found in 
my PhD research will be presented. 



3.1. Witness statements 

Povel 1 sen's witness statement (Schnitler 
1962: 141- 143) amounts to 13 paragraphs 
with a total of 771 words. It includes some 
80 references to places, referring to 47 
different places. 

Looki ng at the form of the pi aces shown i n 
table 2 first, it is clear that the level of 



only two of the 47 places, and in those two 
cases only length, not width, is given. 
The general direction (eg., that a lake 
stretches from northeast to southwest) is 
also given for two places. There is almost 
no information in the text as to what the 
pi aces I ook I i ke, and thus, few cl ues tel I i ng 
us how they should be drawn. Some size 
restrictions are given by the relationships 
to other places, but only in a very vague 
way. In some cases, other indications re- 
lated to both the form of a place and the 
relationships to other places are given, 
such as rivers connecti ng to lakes at speci- 
fied places. But only a few such indica- 
tions are given, and they are quite vague. 

M ost of the 47 pi aces do have spati al con- 
necti ons to other places, but the spatial specificity varies. Table 3 shows the 
different connection types and the number of connections for which each of 
them are specified. When both direction and distance are given, we have 
the most specified relationships. Part of and between are less specific, 
spatially speaking. But these are differences in grade; all the relationships 



Places 


Type 


Count 


Relative 


Total number 


47 


1 00.0 % 


Has length 


2 


4.3 % 


Has width 





0.0 % 


Has direction 


2 


4.3 % 


Table 2. Statistics for form of places. 


Relationships between places 


Type 


Count 


Relative 


Total number 


48 


1 00.0 % 


With direction 


12 


25.0 % 


With distance 


13 


27.1 % 


Part of 


17 


35.4 % 


Between 


14 


29.2 % 



Table 3. Statistics for relationships 
between places. The sum of the differ- 
ent types of relationships is higher than 
the total number of relationships be- 
cause many of the relationships have 
both direction and distance specified. 



between places are underspecified. An example showing how underspecifi- 
cation works can be seen in figures 2 and 3, where two different maps based 
on the same textual description, taken from (Schnitler 1962: 14]), are 
shown. The different rules of interpretation on which they are based are: 

1 South is interpreted as 180° (fig. 2) or 150° (fig. 3). 

2. Default width and length of a place polygon when no measurements 
are given: 4000*6000 (fig. 2) or 1000*500 (fig. 3) meters. 

3. Default distance in X and Y direction of something between some- 
thing else, when distance and direction are not given: 500*200 (fig. 2) or 
1000*2000 (fig. 3) meters. 

4. The length of a mile when type is not specified: 8000 (fig. 2) or 
6000 (fig. 3) meters. 



•Deris Nyebyggerplatzer 




1000 2000 



*Leefba(c)ken 



Figure 2. Map based on textual description, version 1 Scale in metres. 



Some non-spatial relationships can also be put on a topographical map; 
eg., if two places are related only by both being farms, the relationship can 
be expressed by the use of the same symbol on the map. However, if there is 
no spatial relationship indicating where one is located relative to the other, 
the places cannot be put on the same map in any other way than by guess- 
ing the location of one of them relative to the other. The fact that both of 
them are farms tells us that they can be represented by the same symbol, 
but not their spatial relationship. 



•Deris Nyebyggerpbtzer 



•Fiostviigen 



> Gwern berger 
len 






1000 


2000 





Figure 3. Map based on textual description, version 2. Scalein metres. 

I n a communication situation it is common to mention a well-known place 
without specifying where it is in relation to other places. This will still give 
its location to an informed reader. A place name should, when it is used 
within the group of people knowing it, "instantly evoke ...the idea of one 
particular place through an association by c o n t i g u i t y" (Olsen 1928: 5; 
highlighted in original). 

The place names in Schnitler (1962) give the locations of the places through 
such associations, and through the locations thus given, spatial relations to 
other places are also implied. These relations are not found in the text, but 
in the knowledge of the reader. The writer is also a reader under this per- 
spective: he is thefi rst reader. Some of the readers, even the writer, may not 



know where some of the places are. Then they will have to fall back on the 
spatial relations given by the text in order to get an idea of the whereabouts 
of the places based on other places whose location they do know. In my 
modelling, asituation where the reader knows about none of the places re- 
ferred to by the place names is simulated. There is no way to choose one 
and only one correct way of expressing what we are able to put on a map 
based on thetext only. 

This overall pattern is repeated throughout the text. There are, however, 
some room for variation. Another witness, 01 e Nilsen, shows somewhat 
higher levels of connectedness. I n his interview, there are a total of 215 ref- 
erences to places (Schnitler 1962: 150- 155). These 215 expressions refer to 
115 different places. The total number of relationships between places is 
155. The specif i city of the rel ati onshi ps is hi gher than the one we found for 
Povel. The percentages of distance and part of are more or less the same, 
but there are more directions at the expense of between here, which gives a 
higher level of specificity. 

When we look at descriptions of places, the difference is even clearer. 
Length, width and direction are all used for more than 10% of the places in 
Ole's testimony, compared to less than 5% for Povel 's. So 01 e describes 
form more than Povel. Still, even 01 e gives no such descriptions of form for 
almost 90% of the places. This indicates that even if there is room for some 
variation in descriptions of the form of places, the level is still low. The wit- 
nesses depend on the instant evocation of the particular place in the mind 
of the reader— perhaps not consciously, but at least such dependence is part 
of thei r way of expressi ng themselves. 



3.2. Route descriptions 


The road to J emtd and the ordi nary one goes from 0stbye Farms to 




0ye mountai n can be esti mated 


1 new mile, 


from therethrough a little birch forest by the western end passing by 




the Lake Eesand i n the east to Remmen 


lmile 


from therethrough a little bi rch forest to olvaa-Kol 


lmile 


from here through the forest to Handol the 2 first farms in J emteland 


2 miles 


New 


5 miles — 



Figure 4. English translation of a route description (Schnitler 1962: 54). 



Route descriptions are common in the text. I n the route description from 
Norway to Sweden shown in figure 4, the directions of the roads are not 
given in the text. If the reader takes the wider context into consideration, 
the general direction would have been known, because Sweden is generally 
east of Norway in this area. But this is not ex- 
pressed in the text fragment. I n line with what we 
saw above we cannot make a topographical map 
representing the route based on this information 
alone; several choices have to be made. We would, 
for instance, have to choose an exact direction for 
each of the relations. 

I nstead of trying to express the model of this text 
fragment as a topographical map, I tried a topo- 
logical map instead. The text of the route descrip- 
tion does not contain enough information to pro- 
duce one specific topographical map, but it turned 
out that one and only one topology is readable 
from it; thus, two significantly different topologi- 
cal maps cannot be made, as long as we see "sig- 
nificantly different" as "expressing different to- 
pologies". 

The text does not as much describe a landscape, 
which would be translatable to a topographical Figur 5. a topological map 

map, as a network, which is translatable to a topo- f based ° n ^° me ° f the , text in 
, . . . ,. ._ . . i ■ i ■ figure 4. Topological maps 

logical map. In figure 5, such a topological map is ar a e usuaNy not jj^ed, ^ 

shown. In addition to what we usually have in thisoneis. 



<S) 0ye-ficld 
11.3 km 

(j >) Remmen 
11.3 km 

(»>)olvaa-K0l 

22.6 km 
® Handel 



topological maps, the text also informs us about the relative distances, so 
that the topological map in this case can be made to scale; the road between 
one pair of places is double the length of the two other roads. But there is a 
difference between the topological connections between the places, on one 
hand, and the scale, on the other. The former is precisely as it was described 
in the text, whereas the latter has the inaccuracy inherent in all distances in 
the text, and thus, the interpretation can be chosen to be different. 

Route sketch maps have previously been shown to be based on the same 
mental models as textual route descriptions (Tversky and Lee 1999). That 
confirms the similarity between a textual description and a topological map 
shown here. A topological map is a good way to express a network with only 
some spatial constraints; thesamecan be said of a route description. 



4. Conclusion 

When creating maps based on a text using previously existing maps and 
local knowledge, it is often doable to express information from the text un- 
ambiguously on the map. But in that case one locks the text to one geo- 
graphical reading. This may be problematic for historical texts. While I do 
not propose to use the method described here in normal historical text 
studies, I would suggest the results are taken as a further evidence of how 
much of what we read is based on the knowledge with which we meet the 
text. Many choices must be made, also when the choices are linked to a 
choice of which map to use. 

For most spatial features, there is more or less leeway in how they can be 
interpreted if we rely on the text alone. A statement giving only a very lim- 
ited leeway would be "A is exactly 103 km from B, in a direction of exactly 
87.6°". There are no such statements in our source text, the relationships 
are significantly I ess specified if expressed at all. However, the semi otic sys- 
tem of a topographical map callsfor a high level of precision, which must be 
added for the map to be created. 

Although there are ways to at least partially tackle the problems described 
in this paper, such as fuzzy logic (Petry, Cobb et al. 2005), the differences 
between texts and maps still remain. Fuzzy forms on maps are commonly 
used for thematic mapping, but less so in historical mapping of topograph- 
ical features other than for inherently fuzzy objects such as mountains. It 
may take the development of new map schemata (MacEachren 2004: 198) 
to change this. And the question remains to what degree new ways of un- 
derstandi ng maps can bridge the gap between the semi otic systems of texts 
and maps. 



The results described here call for further research. I n order to understand 
textual underspecifi cation better, more texts from a variety of cultures 
should be studied, in light of previous textual studies (Eide Forthcoming 
2013). Further, better methods for modelling should be implemented in 
order to deepen our understanding of what can be expressed in the differ- 
ent media. Dynamic maps should also betaken into consideration. 
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